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Claim 
. . . and 

a queuing structure coupling the first level cache and the second level 

cache . 



2 The cache memory of claim 1, wherein the first level cache and the 
second level cache are adapted to simultaneously receive a 64 bit virtual 
address at each one of the first and second address ports, respectively. 

The cache memory of claim 2, wherein the first address port and the 
second address port for the second level cache simultaneously receive a 
[irst virtual address and a second virtual address. 

^^The cache memory of claim 1, wherein the first level cache is 
^adapted to contain only integer data, and wherein the second level 
cache is adapted to include integer and floating point data. 

5 The cache memory of claim 1, wherein the first address port for the 
first level cache and the first address port for the second level 
cache are adapted to simultaneously receive a first virtual address, 
and wherein the first level cache and the second level cache are 
adapted to initiate a cache lookup for the first virtual address in a 
first clock cycle. 

6 The cache memory of claim 5, wherein the first level cache memory is 
adapted to complete the cache lookup for the first virtual address in a 
rlrsi: clock cycle, and wherein the queuing structure is adapted to signal 
i j tirst level cache hit /miss for the first virtual address to the second 
level cache after a second clock cycle. 

16 
LI 

:Suisudwoo Xiouiguioqovo diqo-uo oqj'Xiomaw oqoeo diqo-uo uv 
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:Suisudiuoo di o...a first cache hit/miss signal corresponding to the 
first virtual address through a queuing structure to an arbitrator in the 
second level of the cache memory after a second processor clock cycle. 

18 The method of claim 17, wherein providing a first cache hit/miss 
signal corresponding to the first... 



ADDRESS IN A FIRST LEVEL OF SECOND VIRTUAL ADDRESS IN 
THE CACHE MEMORY. . . 
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. . . walk logic after invoking the second 

level access logic (or vice versa) , if an access 
request signal and a translate request signal are both 
received simultaneously or are pending concurrently . 

The second level access logic also performs 
housekeeping functions, such as servicing a snoop 
request signal and flushing a table entry from the 
second level cache line to main memory. 

In one embodiment, the two logics of the second 
level control unit are both formed on a single 
.-.icroprocessor chip that includes two first level 
caches: an instruction cache and a data cache . The 
instruction cache includes an instruction word cache 
that holds instructions and an instruction TLB that 
holds table entries for translation of instruction 
addresses. The data cache includes a data word cache 
that holds data operands and a data TLB that holds 
table entries for translation of data addresses. The 
two word caches and the two TLBs... 



11/3, K/37 (Item 4 from file: 349) 

DIALOG(R) File 349: PCT FULLTEXT 

(c) 2004 WIPO/Univentio. All rts. reserv. 

00336800 **Image available** 

METHOD AND CIRCUITRY FOR EXECUTING READ COMMANDS AND MODIFYING DATA WORDS 

IN A MULTI-LEVEL DISTRIBUTED DATA PROCESSING SYSTEM 
PROCEDE ET CIRCUITS D' EXECUTION DE COMMANDES DE LECTURE ET DE MODIFICATION 

DE MOTS DE DONNEES DANS UN SYSTEME INFORMATIQUE DISTRIBUE MULTI-NIVEAUX 

Pa tent Appl lean t /Assignee : 

UNISYS CORPORATION, 
i n vent or ( s ) : 

CASTLE David Edgar, 

DONLEY Greggory Douglas, 

FLORA Laurence Paul, 

MA NO J Manoj, 



I 

.:ASSONE Brian Joseph, 
Ivjr-.er.i: and Priority Information (Country, Number, Date) : 
Patent: WO 9727543 Al 19970731 

Application: WO 97US1122 19970124 (PCT/WO US9701122) 

Priority Application: US 96591843 19960125; US 96591844 19960125 

Designated States: JP AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE 

Publication Language: English 

Full text Word Count: 11280 

FuJ itext Availability: 
Detai led Description 

English Abstract 

A two port high level cache memory for use in a multi-level distributed 
data processing system includes: 1) a first tag-memory which receives 
READ commands through one port from a system bus; and 2) a second 
tag-memory which receives READ commands through another port from a 
processor bus. The two tag memories store identical sets of comparable 
addresses. With two tag memories the high level cache memory is 
able to respond immediately to two different READ commands which occur 
concurrently on the processor bus and the system bus. Also, a 
multi-level distributed data processing system includes: 1) a system bus 
(13) having a main memory (11) coupled thereto; 2) multiple high level 
cache memories (14), each of which has a first port coupled to the 
system bus (13) and a second port coupled to a respective processor bus 



Detailed Description 

in the entire Pentium Pro system. 

However, a problem which needs to be addressed in the 

above multi-level distributed data processing system is that each high 
level cache memory preferably should be able to respond quickly and 
simultaneously to two different READ commands, one of which occurs 
on a processor bus and the other of which occurs on the system bus. If 
the READ command on the processor bus is for a data word which is 
^stored in the high level cache memory, then the high level cache 
memory preferably should present that data word on the processor 
bus quickly in order to enhance system performance. At the same 
time, if the READ . . . 



11/3, K/38 (Item 5 from file: 349) 

DIALOG (R) File 34 9:PCT FULLTEXT 

(c) 2004 WIPO/Univentio. All rts. reserv. 

00358736 **Image available** 

SYSTEM AND METHOD FOR PROCESSING MULTIPLE REQUESTS AND OUT OF ORDER RETURNS 
SYSTEME ET PROCEDE DE TRAITEMENT DE DEMANDES MULTIPLES ET DE RETOURS 
DESORDONNES 

:'v c 3 r ; - App 1 i c a n t / As s i g nee : 
YO.; SYSTEMS INC, 
■ f ' s ; : 

' . •' l.o Trong , 

WARA Yasua ki, 

Pvorit and Priority Information (Country, Number, Date): 
Patent: * WO 964 1250 A2 19961219 

Application: WO 96US8549 19960603 (PCT/WO US9608549) 

Priority Application: US 95479035 19950607 
Designated States: JP KR AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE 
Publication Language: English 
Fulltext Word Count: 8656 

F..3.1 lte:-:t Availability: 
Het a i led Desc ription 

Dei ailed Description 

READTWRITE port allows reading and 
writing of appropriate block-frame addresses in TAG RAM 302 and the 



corresponding data in S-RAM 304, while the READ port allows snooping 

for 

synchronizing multiple processor operations and bus coherency 

FIG. 3 further depicts a two - level memory-hierarchy. As is 
well-known, the upper cache level allows quicker retrieval of data 
stored in the cache S-RAM 304 compared to the greater storage capacity 
but slower retrieval time of data stored in the main memory level in 
memory 112. Such... 
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Detailed Description 

from the shared level 2 cache 30 to the first level I cache 20, as in 
step 207, the inclusion field in the shared level cache 30 is updated 
(to reflect that the first level 1 cache 25 now contains the 
tag-address corresponding to the required information), as in step 208, 
■j. r :-i - r hf-' requested information is sent to the first bus master 10, as in 

.:('?. The sending of the information from the shared level 2 
cache to the first level I cache 25 can be done concurrently 

A'l-r. }-h« sending of the information from the shared level 2 cache 
'. tc 1 he fcirst bus master 10 that made the memory read access 

1 £ a memory read access is made by a bus master that does not utilize 
the shared level 2 cache 30, such as the third bus master 55 as shown 
in Figure 1, then the procedure is a little different than that explained 
above . The . . . 

Claim 

2 cache to determine if said level 2 cache contains a tag-address 
that coincides with said 

memory access address; 

(1) if said level 2 cache does not contain any tag-address that 
coincides with said memory access address, obtaining said information 
corresponding to said memory access address from said main... for said one 
storage location within said level 2 cache indicates that said 
tag-address does not reside in any of said plurality of level 1 caches 
connected to said level 2 cache; 

(1) said one of said level I caches copying said information 
13 

corresponding to said coincident tag-address from said level 2 cache 
into an available one of said plurality of storage 
locations within said one of said level I caches; 



(2) said one of said level 1 caches sending said information 

residing 

in said available one of said plurality of storage locations within said 
one of said level 1 caches to said one of said bus 
masters ; 

(3) said level 2 cache updating said inclusion field corresponding to 
said one of said plurality of storage locations within said level 2 
cache which has said coincident tag-address stored therein to indicate 
that said one of said level 1 caches contains said coincident 
rag-address; (b) if said inclusion field for said one storage location 
within said level 2 cache indicates that said tag-address ... 
another of said level I caches sending said information 
corresponding to said coincident tag-address to said level 2 cache; 

(2) said level 2 cache retrieving said information corresponding to 
said coincident tag-address from said another of said level I caches 
and copying said information into an available one 

of said plurality of storage locations within said one level 2 
cache ; 

(3) said one of said level I caches copying said information 
corresponding to said coincident tag-address from said level 2 
cache into an available one of said plurality of storage 
locations within said one of said level 1 caches; 

(4) said one of said level I caches sending said information 
corresponding to said coincident tag-address to said one of said bus 
masters; and 

(5) said level 2 cache updating said inclusion field corresponding to 
said one of said plurality of storage locations within said level 2 
cache to indicate that said one of said level I caches contains said 
coincident tag-address stored therein. 

13. A method as in claim 12, comprising the further step of 
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Claim 

. . . output device 

and the associated prefetch pointer is unmodified in main memory 
when a line of data is written by a processor device. 



1.1 The cache memory system as set forth in claim 1, 

wherein the cache memory system comprises a first level cache and a 



prefetching second level cache, said second level cache operating 

in 

parallel with the first level cache between the requesting device 

and 

main memory. 

12 The cache memory system as set forth in claim 1, 

wherein the cache memory system comprises a first level cache and a 
prefetching second level cache, said second level cache operating 

in 

series between the first level cache and main memory. 

13 The cache memory system as set forth in claim 1, 
wherein the prefetch pointer is stored in parity fields... 

..prefetch pointer is stored in a region of main memory 
separate from the line of data with which the prefetch pointer is 
associated . 

\ T:.o cache memory system as set forth in claim 13, 
• rvi:. EDC .information is further stored in the parity fields located 

, • ■ : - ; ir.j; of data. 

1 i. Ls written by the processor. 

59 The processing system as set forth in claim 48, wherein if 

the prefetch pointer comprises a Null value, said cache controller 
causing the next sequential line in memory to be prefetched. 

60 The processing system as set forth in claim 48, wherein 
the cache memory system comprises a first level cache and a 
prefetching second level cache, said second level cache operating 
in 

parallel with the first level cache between the processor and main 
memory. - 4 5 

61 The processing system as set forth in claim 48, wherein 

the cache memory system comprises a first level cache and a 
prefetching second level cache, said second level cache operating 
in 

series between the first level cache and main memory. 

62 The processing system as set forth in claim 48, 

comprising an instruction cache for storing lines of data comprising 
instructions and a data cache for storing lines of data comprising other 
information requested, each cache maintaining independent prefetch 

pointers . 

6'J The processing system as set forth in claim 48, wherein 
r.he cache is adirect mapped cache. 

64 The processing system. . . 
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Dec ailed Description 

. . . below to functional units will normally be understood to mean elements 

CPU 10, not such external devices. 

■V. i :.s r. ruCt ion Decoder (DEC) 12 performs instruction fetch , instruction 
:.-.\-)de, and pipeline control. DEC 12 optionally interleaves instruction 
prefetch of up to three simultaneous instruction streams. DEC 12 
contains a two - level Branch Prediction Cache (BPC) 13. lie BPC 
includes an integrated structure which contains dynamic branch history 
data, a physical branch target address, and a branch target buffer for 
each cache entry. As branch instructions are decoded, the BPC is 
consulted for information about that branch. Independent of the direction 
predicted, branches are executed in a... history bits, and 24 target 
instruction bytes with 3 target instruction valid (TIV) bits. Second 
level BPC 155 is a deep but narrow structure which caches only partial 
prediction information but for a much larger number of branch 
instructions. Second level BPC 155 contains 256 entries, each containing 
two bytes of partial target address information and one history bit. 

In parallel with instruction decoding, the instruction's decode PC is 
used to perform parallel lookups in the first and second level 

BPCs. (Since the incoming instructions have not been decoded at this 
point, non-branch instructions are also checked ). In the event of a hit 
on first level BPC 152, the target instruction bytes are communicated 
to instruction decoder 160, the branch history bits are communicated to 
I DC 162, and the target address ... bad address prediction is no worse than 
no prediction. Further, the occurrence of such bad predictions 
corresponds to the miss rate of the second level cache and thus is 
reasonably low given the relatively large number of cache entries. This 
is also enhanced by applying the hardware savings from eliminating the 
raq storage to increasing the number of cache entries. 

f )pe i a r. ion 

A first level cache size of 36 entries and second level cache size of 
256 entries, in combination with a factor of 16 difference in per-entry 
cost, results in second-to- first level cache ratios of eight times 
the number of entries, yet still almost half the size. With this much 
larger size, even given the direct-mapped organization, the second 
level cache provides an effective backup to the first level cache. 

As each branch instruction is fetched its address is used to perform 
parallel look-ups in the two levels of BPC: the large-set or fully 
associative first level access using the full branch address; and the 
direct-mapped, tag-less second level using only a subset of the 
address bits for the index. 

If there is a tag match with a first level cache entry, then all of 
this entry's prediction information is read out, and the second level 
BPC is ignored. All the necessary predictions are made, effectively 
eliminating or hiding any delays in otherwise processing the branch 
instruction and in starting processing... 
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number of cache read or write operations in 
conjunction with the AU 10. As has been previously 
mentioned, the AU 10 determines the number of cache read or 
write cycles required for a particular operand as a function 
of the operand alignment, as indicated by VA<29:31>, and the 
length of the operand, as indicated by bits 8-15 of IRO as 
stored in LI / L2 98, The AU 10 also provides this information 
f.o CP 12 such that both units operate in tandem to read 
operands and store results in the CP 12 cache memory, This 
aspect of the invention will be further discussed below in 
relation to the flow chart of Fig. 5d, 
For unconditional branch type operations... 
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... : ■ :.e memory parts and each of the cache sets. The cache receives a 
i ,;.;k over the bus in two pairs of contiguous words. The cache memory 
is updated wich both words of a word pair simultaneously. The pairs of 
words are each stored simultaneously into locations of one of either of 
Lhe cache sets, each word into a location of a different memory part 
and of a different level . Cache hit check is performed on all 
locations of a level simultaneously . All locations of the checked 
level are accessed simultaneously , and the cache hit check is 
performed on all locations of the checked level simultaneously. 



